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S ' Abstract 

c3 ; 

We prove that the complexity class QIP, which consists of all problems having quantum 
interactive proof systems, is contained in PSPACE. This containment is proved by applying a 
parallelized form of the matrix multiplicative weights update method to a class of semidefinite 
' programs that captures the computational power of quantum interactive proofs. As the con- 

tainment of PSPACE in QIP follows immediately from the well-known equality IP = PSPACE, 
■ the equality QIP = PSPACE follows. 

^ ! 1 Introduction 

O . 

Q\ . Efficient proof verification is a fundamental notion in computational complexity theory. The most 

direct complexity-theoretic abstraction of efficient proof verification is represented by the com- 
plexity class NP, wherein a deterministic polynomial-time verification procedure decides whether 
a given polynomial-length proof string is valid for a given input. One cannot overstate the im- 
portance of this class and its presently unknown relationship to P, the class of problems solvable 
deterministically in polynomial time. This problem, which is known as the P versus NP problem, 
is one of the greatest of all unsolved problems in mathematics. 

In the early to mid 1980's, Babai RBab85B and Goldwasser, Micali, and Rackoff HGMR85II intro- 
duced a computational model that extends the notion of efficient proof verification to interactive 
settings. (Journal versions of these papers appeared later as |BM88] and |GMR89|.) In this model, 
which is known as the interactive proof system model, a computationally bounded verifier interacts 
with a prover of unlimited computation power. The interaction comprises one or more rounds 
of communication between the prover and verifier, and the verifier may make use of randomly 
generated bits during the interaction. After the rounds of communication are finished, the verifier 
makes a decision to accept or reject based on the interaction. 

A decision problem A is said to have an interactive proof system if there exists a verifier, 
always assumed to run in polynomial time, that meets two conditions: the completeness condition 
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and the soundness condition. The completeness condition formalizes the requirement that true 
statements can be proved, which in the present setting means that if an input string x is a yes- 
instance of A, then there exists a course of action for the prover that causes the verifier to accept 
with high probability. The soundness condition formalizes the requirement that false statements 
cannot be proved, meaning in this case that if an input string x is a no-instance of A, then the 
verifier will reject with high probability no matter what course of action the prover takes. One 
denotes by IP the collection of decision problems having interactive proof systems. (Here, and 
throughout the rest of the paper, we take the term problem to mean promise problem, and consider 
that all complexity classes to be discussed are classes of promise problems. Promise problems 
were defined by Even, Selman and Yacobi |ESY84L and readers unfamiliar with them are referred 
to the survey of Goldreich IGol05l .1 

The expressive power of interactive proof systems was not initially known when they were 
first defined, but it was soon determined to coincide with PSPACE, the class of problems solvable 
deterministically in polynomial space. The containment IP C PSPACE, which is generally at- 
tributed to Feldman IIFel86L is fairly straightforward — and readers not interested in proving this 
fact for themselves can find a proof in MHO02L Known proofs MLFKN921 |Sha92t |She92|| of the re- 
verse containment PSPACE C IP, on the other hand, are not straightforward, and make essential 
use of a technique commonly known as arithmetization. This technique involves the extension of 
Boolean formulas to multivariate polynomials over large finite fields whose and 1 elements are 
taken to represent Boolean values. Through the use of randomness and polynomial interpolation, 
verifiers may be constructed for arbitrary PSPACE problems. 

Many variants of interactive proof systems have been studied, including public-coin interac- 
tive proofs IBab85ilBM88llGS89l , multi-prover interactive proofs IIBOGKW88H , zero-knowledge in- 
teractive proofs HGMR891 IGMW911 , and competing-prover interactive proofs [FK97J. The present 
paper is concerned with quantum interactive proof systems, which were first studied a decade after 
IP = PSPACE was proved II Wat99 1 iKWOOIl . The fundamental notions of this model are the same as 
those of classical interactive proof systems, except that the prover and verifier may now process 
and exchange quantum information. Similar to the classical case, several variants of quantum 
interactive proof systems have been studied (including those considered in [ HKSZ081 IKKMV091 
IKM031 lKob58l IMW051 IWat09ln . 

One of the most interesting aspects of quantum interactive proof systems, which distinguishes 
them from classical interactive proof systems (at least to the best of our current knowledge), is that 
they can be parallelized to three messages. That is, quantum interactive proof systems consisting 
of just three messages exchanged between the prover and verifier already have the full power of 
quantum interactive proofs having a polynomial number of messages [KWOOJ. Classical inter- 
active proofs are not known to hold this property, and if they do the polynomial-time hierarchy 
collapses to the second level IIBM88|I . 

The complexity class QIP is defined as the class of decision problems having quantum inter- 
active proof systems. QIP trivially contains IP, as the ability of a verifier to process quantum 
information is never a hindrance — a quantum verifier can simulate a classical verifier, and a com- 
putationally unbounded prover can never use quantum information to an advantage against a 
verifier behaving classically. The inclusion PSPACE C QIP is therefore immediate. The best upper 
bound on QIP known prior to the present paper was QIP C EXP, which was proved in HKWOOH 
through the use of semidefinite programming. The optimal probability with which a given verifier 
can be made to accept in a quantum interactive proof system can be represented as an exponential- 
size semidefinite program, and known polynomial-time algorithms for semidefinite programming 
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provide the required tool to prove the containment. It has been an open problem for the last decade 
to establish more precise bounds on the class QIP. 

It was recently shown in the paper | |JUW09| that QIP (2), the class of problem having 2-message 
quantum interactive proof systems, is contained in PSPACE. That paper made use of a parallel 
algorithm, based on a method known as the matrix multiplicative weights update method, to ap- 
proximate optimal solutions for a class of semidefinite programs that represent the maximum 
acceptance probabilities for verifiers in two-message quantum interactive proofs. In this paper we 
extend this result to all of QIP, establishing the relationship QIP = PSPACE. Similar to |JUW09| , 
we use the matrix multiplicative weights update method, together with parallel methods for ma- 
trix computations. 

The multiplicative weights method is a framework for algorithm design having its origins in 
various fields, including learning theory, game theory, and optimization. Its matrix variant, as 
discussed in the survey paper BAHK05M and the PhD thesis of Kale |Kal07L gives an iterative 
way to approximate the optimal value of semidefinite programs MAK07llWK06L In addition to its 
application in | [JUW09[ , it was applied to quantum complexity in [JW09 1 to prove the containment 
of the complexity class QRG(l) in PSPACE. The key strength of this method for these applications 
is that it can be parallelized for some special classes of semidefinite programs. 

A key result that allows our technique to work for the entire class QIP is the characterization 
QIP = QMAM proved in |MW05I . This characterization, which is described in greater detail in 
the next section, concerns a restricted notion of interactive proof systems known as Arthur-Merlin 
games. An Arthur-Merlin game is an interactive proof system wherein the verifier can only send 
uniformly generated random bits to the prover. Following Babai [Bab85], one refers to the verifier 
as Arthur and to the prover as Merlin in this setting. It is also typical to refer to the individual bits of 
Arthur 's messages as coins, given that they are each uniformly generated like the flip of a fair coin. 
The restriction that Arthur sends only uniformly generated bits to Merlin, and therefore does not 
have the option to base his messages on private information unknown to Merlin, would seem to 
limit the power of Arthur-Merlin games in comparison to ordinary interactive proof systems. But 
in fact this is known not to be the case, both for classical [GS89J and quantum |MW05B interactive 
proof systems. In the quantum setting, this characterization admits a significant simplification in 
the semidefinite programs that capture the complexity of the class QIP. 

The remainder of this paper has the following organization. Section [2] includes background 
information, notation, and other preliminary discussions that are relevant to the remainder of the 
paper. Section [3] describes a semidefinite programming problem that captures the complexity of 
the class QIP based on quantum Arthur-Merlin games, and Section|4]presents the main algorithm 
that solves this problem. Finally, Section [5] discusses a parallel approximation to the algorithm 
from Section0]and explains how its properties lead to the containment QIP C PSPACE. 



2 Preliminaries 

This section contains a summary of the notation and terminology on linear algebra, quantum in- 
formation, semidefinite programming, quantum Arthur-Merlin games, and bounded-depth cir- 
cuits that is used later in the paper. For the most part, these discussions are intended only to 
make clear the notation and terminology that we use, and not to provide introductions to these 
topics. We assume that the reader already has familiarity with complexity theory and quantum 
computing, and refer readers who are not to HABQ9H and MNCOOL 
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2.1 Linear algebra and quantum information 

A quantum register refers to a collection of qubits, or more generally a finite-size component in a 
quantum computer. Every quantum register V has associated with it a finite, non-empty set Z 
of classical states and a complex vector space of the form V = C E . We use the Dirac notation 
{\a) : a G £} to refer to the standard basis (or elementary unit vectors) in V, and define the inner 
product and Euclidean norm on V in the standard way. The set {{a\ : a G £} consists of the 
elements in the dual space of V that are in correspondence with the standard basis vectors. 

For such a space V, we write L (V) to denote the space of linear mappings, or operators, from 
V to itself, which is identified with the set of square complex matrices indexed by E in usual way. 
An inner product on L (V) is defined as 

(A,B) — Jr(A*B), 

where A* denotes the adjoint (or conjugate transpose) of A. The identity operator on V is denoted 
ly (or just 1 when V is understood). 

The following special types of operators are relevant to the paper: 

1. An operator A G L (V) is Hermitian if A — A*. The eigenvalues of a Hermitian operator are 
always real, and for m = dim(V) we write 

Ai(A) > A 2 (A) > ■■■ > A m (A) 

to denote the eigenvalues of A sorted from largest to smallest. 

2. An operator P G L (V) is positive semidefinite if it is Hermitian and all of its eigenvalues are 
nonnegative. The set of such operators is denoted Pos (V). The notation P > also indicates 
that P is positive semidefinite, and more generally the notations A < B and B > A indicate 
that B — A > for Hermitian operators A and B. 

Every Hermitian operator A can be expressed uniquely as A = P — Q for positive semidefinite 
operators P and Q satisfying (P, Q) = 0. The operator P is said to be the positive part of A, 
while Q is the negative part. 

3. A positive semidefinite operator P G Pos (V) is also said to be positive definite if all of its eigen- 
values are positive (which implies that P must be invertible). The notation P > also indicates 
that P is positive definite, and the notations A < B and B > A indicate that B — A > for 
Hermitian operators A and B. 

4. An operator p G Pos (V) is a density operator if it is both positive semidefinite and has trace 
equal to 1. The set of such operators is denoted D (V). 

5. An operator n G Pos (V) is a projection if all of its eigenvalues are either or 1. 

A quantum state of a register V is a density operator p G D (V), and a measurement on V is a 
collection {P/, : b G T} C Pos (V) satisfying 

The set T is the set of measurement outcomes, and when such a measurement is performed on V 
while it is in the state p, each outcome b G T occurs with probability (Pb,p)- 
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The spectral norm of an operator A G L (V) is defined as 

|| A || = max{|| Av\\ : v G V, \\v\\ = 1}. 

The spectral norm is sub-multiplicative, meaning that 1 1 AB \\ < \\A\\ \\B\\ for all operators A, B G 
L (V), and it holds that || P || = Ai (P) for every positive semidefinite operator P. For any operator 
A G L (V), the exponential of A is defined as 

exp(A) = 1 + A + A 2 /2 + A 3 /6 + ■■■ 

The Golden-Thompson Inequality (see Section IX.3 of MBha97l ) states that, for any two Hermitian 
operators A and B on V, we have 

Tr [exp(A + £>)] < Tr [exp(A) exp(B)] . 

The tensor product V <8> W of vector spaces V = C z and W = C r may be associated with the 
space C Exr , and the tensor product of operators A G L (V) and B G L (W) is then taken to be 
the unique operator A® B G L (V <g> W) satisfying (A ®B)(v® w) = (Av) ® [Bw) for all u G V 
and it> G W. These notions may be associated with the usual Kronecker product of vectors and 
matrices. For quantum registers V and W, the space V <8> W is associated with the pair (V,\N), 
viewed as a single register. Tensor products involving three or more spaces are handled similarly. 

For a given linear mapping of the form <E> : L(V) — > L(W), one defines the adjoint mapping 
<I>* : L (W) — > L (V) to be the unique linear mapping that satisfies 

<B,0(A)) = (0*(B),A) 

for all operators A G L (V) and B G L (W). 

Finally for spaces V and one defines the partial trace Try : L (V ® W) — >• L (W) to be the 
unique linear mapping that satisfies Tr v (A ® B) = (Tr A)B for all A G L (V) and B G L (W). A 
similar notation is used for the partial trace Tr>V/ or partial traces defined on three or more tensor 
factors. When this notation is used, the spaces on which the trace is not taken are determined by 
context. When a pair of registers (V, W) is viewed as a single register and has the quantum state 
p G D (V ® W), one defines the state of W to be Try (p). In other words, the partial trace describes 
the action of destroying, or simply ignoring, a given quantum register. 



2.2 Semidefinite programming 

A semidefinite program over complex vector spaces V and W is a pair of optimization problems as 
follows. 

Primal problem Dual problem 

maximize: (C, X) minimize: (D, Y) 

subject to: Y(X) < D, subject to: Y*(Y) > C, 

X G Pos (V) . Y G Pos (W) • 

Here, the operators C G L (V) and D G L (W) are Hermitian and Y : L (V) — > L (W) must be 
a linear mapping that maps Hermitian operators to Hermitian operators. Readers familiar with 
semidefinite programming will note that the above form of a semidefinite program is different 
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from the well-known standard form, but it is equivalent and better suited for this paper's needs. 
The form given above is, in essence, the one that is typically followed for general conic program- 
ming IIBV04II . 

It is typical that semidefinite programs are stated in forms that do not explicitly describe Y, C 
and D, and the same is true for the semidefinite programs we will consider. It is, however, routine 
to put them into the above form. 

With the above optimization problems in mind, one defines the primal feasible set V and the 
dual feasible set V as 

V = {X G Pos (V) : Y(X) < D} , 

V = {Y G Pos (W) : T*(Y) > C} . 

Operators X G V and Y G V are also said to be primal feasible and dual feasible, respectively. The 
functions X i— >• (C, X) and Y i— >• (D, Y) are called the primal and dual objective functions, and the 
optimal values associated with the primal and dual problems are defined as 

a = sup (C, X) and p = inf (D, Y) . 

Semidefinite programs have associated with them a powerful theory of duality, which refers 
to the special relationship between the primal and dual problems. The property of weak duality, 
which holds for all semidefinite programs, states that a < ft. This property implies that every dual 
feasible operator Y G T> provides an upper bound of (D, Y) on the value (C, X) that is achievable 
over all choices of a primal feasible X G V, and likewise every primal feasible operator X G V 
provides a lower bound of (C, X) on the value (D, Y) that is achievable over all choices of a dual 
feasible Y G V. 

It is not always the case that a = f> for a given semidefinite program, but in most natural cases 
it does hold. The situation in which a = f> is known as strong duality, and several conditions have 
been identified that imply strong duality. One such condition is strict dual feasibility: if a is finite 
and there exists an operator Y > such that Y*(Y) > C, then a = /3. The symmetric condition of 
strict primal feasibility also implies strong duality. 

2.3 Single-coin quantum Arthur-Merlin games 

Quantum Arthur-Merlin games were proposed in IMW05I1 as a natural quantum variant of clas- 
sical Arthur-Merlin games. Here, one simply mimics the classical definition in requiring that 
Arthur's messages to Merlin consist of uniformly generated random bits. Merlin's messages to 
Arthur, however, may be quantum; and after all of the messages have been exchanged Arthur is 
free to perform a quantum computation when deciding to accept or reject. 

Of particular interest to us are quantum Arthur-Merlin games in which three messages are 
exchanged, and where Arthur's only message consists of a single bit. In more precise terms, such 
an interaction takes the following form: 

1. Merlin sends a quantum register W to Arthur. Merlin is free to initialize this register to any 
quantum state of his choice, and may entangle it with a register of his own if he chooses. 

2. After receiving W from Merlin, Arthur chooses a bit a G {0, 1} uniformly at random. Merlin 
learns the value of a. 
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3. Merlin sends Arthur a second quantum register Y. He does this after step 2, so he has the 
option to condition the state of Y upon the value of a. The register Y could, of course, be 
entangled with W in any way that quantum information theory permits. 

4. After receiving Y, Arthur performs one of two binary-valued measurements, determined by 
the value of the random bit a, on the pair (W, Y). The measurement outcome 1 is interpreted 
as acceptance, while is interpreted as rejection. 

Arthur's measurements must of course be efficiently implementable. This notion is formalized 
by requiring that the measurements are implementable by polynomial-time generated families of 
quantum circuits, which naturally requires the registers W and Y to consist of a number of qubits 
that is polynomial in the length of the input. Further details may be found in IMW05L 

The result of [MW05] that we make use of is that every problem A G QIP has a single-coin 
Arthur-Merlin game as just described. The game is such that if x is a yes-instance of the problem 
A, then Arthur accepts with probability 1, whereas if the input x is a no-instance of the prob- 
lem then Arthur accepts with probability at most 1/2 + e, for any desired constant e > 0. (In 
the construction given in IIMW05L Arthur's measurements are always nontrivial projective mea- 
surements. This implies that even for no-instance inputs, Merlin can cause Arthur to accept with 
probability at least 1/2 by simply guessing in advance Arthur's random bit.) 

2.4 Bounded-depth circuit complexity 

In the last section of the paper, we will require the definitions of two complexity classes based 
on bounded-depth circuit families: NC and NC(poly). It is convenient for us to define these as 
classes of functions rather than decision problems, and when we wish to view them as classes of 
decision problems we simply restrict our attention to binary-valued functions. The class NC con- 
tains all functions computable by logarithmic-space uniform Boolean circuits of polylogarthmic 
depth, and NC (poly) contains all functions that can be computed by polynomial-space uniform 
families of Boolean circuits having polynomial-depth. For decision problems it is known ||Bor77ll 
that NC (poly) = PSPACE, and the proof of our main result will make use of this fact. 

There are two fundamental properties of NC(poly) that we will take advantage of. The first 
is that functions in NC and NC(poly) compose well, and the second is that many computational 
problems involving matrices are in NC. In more precise terms, the first property is as follows. If 
F : {0,1}* -> {0,1}* is a function in NC(poly) and G : {0, 1}* -> {0, 1}* is a function in NC, 
then the composition G o F is also in NC(poly). This follows from the most straightforward way 
of composing the families of circuits that compute F and G. 

To discuss the second property, it will be helpful to make clear our assumptions concerning 
matrix computations. We will always assume that the matrices on which computations are per- 
formed have entries with rational real and imaginary parts, and that the rational numbers are 
represented as pairs of integers in binary notation. Unless it is explicitly noted otherwise, any 
other rational numbers involved in our computations will be represented in a similar way. 

With these assumptions in place, we first note that elementary matrix operations, including 
inverses and iterated sums and products of matrices, are known to be in NC. There is an extensive 
literature on this topic, and we refer the reader to von zur Gathen's survey HGat93H for more details. 
We also note that matrix exponentials and spectral decompositions can be approximated to high 
accuracy in NC. In more precise terms, the following two problems are in NC. 



7 



Matrix exponentials 

Input: An n x n matrix M, a positive rational number rj, and an integer k expressed in 
unary notation (i.e., 1 ). 

Promise: ||M|| < k. 

Output: An n x n matrix X such that || exp(M) — X || < n. 

Spectral decompositions 

Input: An n x n Hermitian matrix H and a positive rational number n. 
Output: An n x n unitary matrix U and annxn real diagonal matrix A such that 

\\M-UAU*\\ < n. 



The reader will note that in these problems, the description of the error parameter n could require 
as few as 0(log(l/?/)) bits. This implies that highly accurate approximations, for instance where 
n — 1~ n , are possible in NC. The fact that matrix exponentials can be approximated in NC follows 
by truncating the series 

exp(M) = 1 + M + M 2 /2 + M 3 /6 + • • ■ 

to a number of terms linear in k + log(l/7/). (From a numerical point of view this is not a very 
good way to compute matrix exponentials BML03L but it is arguably the simplest way to prove that 
the stated problem is in NC.) The fact that spectral decompositions can be approximated in NC 
follows from a composition of known facts: in NC one can compute characteristic polynomials and 
null spaces of matrices, perform orthogonalizations of vectors, and approximate roots of integer 
polynomials to high precision HCsa76l IBGH821 IBCP831 IBOFKT861 iGltM INef94l . 



3 A semidefinite programming formulation of the problem 

Consider Arthur's verification procedure for a given single-coin QMAM protocol on a fixed input 
string x. Arthur first receives a register W, then generates a random bit a G {0, 1}, and then re- 
ceives a second register Y. He then measures ( W, Y) with respect to a binary-valued measurement 

{P a ,t-Pa} CPos{W®y), 

where we take each of the operators Po and Pi to represent acceptance and 1 — Po and 1 — Pi to 
represent rejection. If the quantum state of (W, Y) is given by a density operator p G D ( W ® y) 
when Arthur measures, he will therefore accept with probability (P fl , p) . 
Now define 

Q = ho) (o|®p + -|i) g pos (x <g) w ^ y) , 

where we take X = C^ 0,1 ^ to be the vector space corresponding to Arthur's random choice of 
a G {0,1}, and consider the optimal probability that Merlin can cause Arthur to accept. If, for 
each of the values a G {0, 1}, Merlin is able to leave the state p a in the registers (W, Y) right before 
Arthur measures, he will convince Arthur to accept with probability 

\{P*,pv) + \{Pi,pi) = (Q,x) (i) 
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for 

X = |0>(0|®p + |l)<l|®pi. 

There is, of course, a constraint on Merlin's choice of po and p\, which is that they must agree on 
W, as Merlin cannot touch the register W at any point after Arthur chooses the random bit a. In 
more precise terms, it must hold that 

Tiy(p )=cr = Txy(p 1 ) (2) 

for some density operator a G D (W). This, in fact, is Merlin's only constraint — for if he holds a 
purification of the state a, he is free to set the state of ( W, Y) to any choice of po and p\ satisfying 
(0 without needing access to W. 

Now, we note that the condition 10 implies that 

Try (X) =\ x ®<r. (3) 

Moreover, for an arbitrary operator X G Pos [X ® W <S> y) satisfying the constraint (0, one has 
that the operators po and p\ defined as 

p a = ((fl| ®l W ®y) Xflfl) 

for a G {0, 1} satisfy the conditions ((D and ((2]). It follows that the following semidefinite program 
represents the optimal probability with which Merlin can convince Arthur to accept. 

Primal problem Dual problem 

maximize: (Q/X) minimize: ||Tr^(Y)|| 

subject to: Try(X) < l x <8> a, subject to: Y(g)ly>Q, 

X G Pos (X <8> W ® y) , Y G Pos ® W) . 

cr GD(W). 

Note that the inequality in the primal problem can be exchanged for an equality without changing 
the optimal value. This is because any primal feasible X can be inflated to achieve the equality 
Try (X) = lx <8> o~ for some choice of a, and this can only increase the value of the objective function 
by virtue of the fact that Q is positive semidefinite. It is immediate that the optimal solution to the 
primal problem is bounded and the dual problem is strictly feasible, from which strong duality 
follows; the primal and dual problems have the same optimal values. 

Now, under the assumption that Q is invertible, one may perform a change of variables to put 
the above semidefinite program into a form that more closely resembles the one in [JUW09| . To 
do this we define a linear mapping O : L {X ® W <8> y) — > L (X <g> W) as 

O(X) = Try (Q- 1/2 XQ- 1/2 ) , (4) 

whose adjoint mapping O* : L (X <g> W) — > L (X ® W ® y) is given by 

cD*(Y) = Q- 1/2 (Y®l3;)Q- 1/2 , 
and consider the following semidefinite program. 
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Primal problem 
maximize: Tr(X) 
subject to: O(X) <l x ®a, 

cr 6D(W). 



Dual problem 
minimize: 1 1 Tr^- (Y ) 1 1 
subject to: 0*(Y) > l^way/ 
Y G Pos (A? ® W) . 



It is clear that this semidefinite program has the same optimal value as the previous one. 

We will be interested in the optimal value of this semidefinite program in the case that || Q _1 1| 
is upper-bounded by a fixed constant and where there is a promise on the optimal value. The 
promise, which will come from the properties of the quantum Arthur-Merlin games under con- 
sideration, is that the optimal value does not lie in the interval (5/8, 7/8), and the goal is to 
determine whether the optimal value is larger than 7/ 8 or smaller than 5/ 8. 

For readers familiar with the semidefinite program for QIP(2) presented in JJUW09J, we note 
that there are two essential differences between it and the one above. The first difference is that 
the semidefinite program in [JUW09| effectively replaces the density operator a with the scalar 
value 1, which would seem to suggest added difficulty for the case at hand. The second difference 
is that X is two-dimensional for the semidefinite program above, whereas it has arbitrary size in 



|JUW09|. This second difference more than compensates for the difficulty induced by the first, 
and we find that the above semidefinite program is actually much easier to solve than the one for 
QIP(2). 



4 The main algorithm and its analysis 

We now present the main algorithm for the semidefinite programming problem from the previous 
section. The algorithm, which is described in Figure [TJ takes as input an operator 

Q e Pos (X ®w ®y) . 

It is assumed that Q is invertible and satisfies || Q _1 1| < 64. (The algorithm could easily be adapted 
to handle any other fixed constant in place of 64, but this choice is sufficient for our needs.) More- 
over, it is assumed that the optimal value of the semidefinite program in Section [3] that is defined 
by Q does not lie in the interval (5/8, 7/8). Our goal is to prove that the algorithm accepts when 
the optimal value is at least 7/ 8 and rejects when the optimal value is at most 5/8. 

Here we present the correctness of the algorithm under the assumption that all computations 
are performed exactly. Issues that arise due to inaccuracies in the computation are discussed in 
the next section. 

Assume first that the algorithm accepts, and write 

p = p t , n = n f/ £ = & and p = p t 

for t G {0, . . . , T — 1} corresponding to the iteration in which acceptance occurs. For the sake of 
clarity, let us note explicitly that 

p E D(X®W®y), UEFos(X^W) and £eD(W). 

We wish to prove that the optimal value of our semidefinite program is at least 7/8, and we will 
do this by constructing a primal feasible solution that achieves an objective value strictly larger 
than 5/8. 
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1. Let N = dim(X <g> W <8> y) and M = dim(W), and define 

W = l#®wa:y/ Po = W /N, Z Q = l w and £ = Z /M. 

Also let 



1 = \' £ = ~k' s = ni^\\ and T 



41og(N) 

e 3 S 



2. Repeat for each t = 0, . . . , T — 1: 

(a) Let lit be the projection onto the positive eigenspaces of the operator 

<>(pt) -7lx®&, 

where O is defined from Q as in ((U, and set fi t — (lit, &(pt))- 

(b) If j6t < e then accept, else let 

W t+ i = exp ^-^^(IL/^ , p t+1 = W f+1 /Tr(W f+1 ), 



and 



Z m = exp \tf £"&*(iyft)J , &+i = Z f+ i/Tr(Z t+1 ). 



3. If acceptance did not occur in step 2, then re/ecf. 



Figure 1: An algorithm that accepts if the optimal value of the semidefinite program in Section|3]is 
larger than 7/8, and rejects if the optimal value is smaller than 5/8. 

By the definition of LI, it holds that 

no(p)n > n(<D(p) - 7 \ x <g> £)n > ®( P ) - 7 i x ® £, (5) 

and by Lemma Q] (which is stated and proved below) it holds that 

21* ® Tr x (no(p)n) > no(p)n. (6) 

Combining the equations (O and ((6]) one has 

<l x ®( 7 Z + 2Tr x (Il<Z>( P )n)). (7) 

It therefore holds that 

X- / ... and tr _TS + 2Tr*(n* W n) 



7 + 2{n,<t(p)) 7 + 2(n,*(rt) 

represent a feasible solution to the primal problem under consideration, achieving the objective 
value 

1 _JL 1 5 

7 + 2(0,0(10)) ~ 7 + 2,6 ~ 7 + 2e > 8 

11 



as required. 

Now assume that the algorithm rejects, and consider the operator 



t=o 

We claim that Y is dual feasible and achieves an objective value that is strictly smaller than 7/8. 
This will imply that the optimal value of the semidefinite program is at most 5/8. 

Let us first prove that Y is dual feasible. It is clear that Y is positive semidefinite, so it suffices 
to prove that <t>*(Y) > tx®w®y> or equivalently that An(0*(Y)) > 1. Observe, for each t = 
0, . . . , T - 1, that 

Tr(W m ) = Tr [exp (-eS0*(Uo/p o + ■■• + Ut/pt))] 

< Tr [exp (-E6®*(Ih/Po + ■ • • + Ut-i/pt-i)) exp (-eS&QJt/pt))] 
= Tr[W t exp(-^0*(n t / J 6 t ))] 

by the Golden-Thompson inequality. As each IT is a projection operator, we have 

2 



|o*(n f ) 



Q- l/2 {Tl t ®ly)Q 



-1/2 



< 



Q 



-1/2 



Q 



-1 



where we have used the sub-multiplicativity of the spectral norm to obtain the inequality. Given 
that /3f > £ in the case at hand, it follows that \\SG?*(IIt/ fit) II < 1- By Lemma |2] (also presented 
below) it therefore follows that 

exp (-aSO* (n f /0 f )) < 1 ~ £<5exp(-£)0*(n f //3 f ). 
As each W t is positive semidefinite, we obtain 



Tr(W m ) < Tr(W t ) 1 -aJexp(-e) 



-,o*(n t /p t ) 



(8) 



Tr(W f ) 

Substituting p t = W t /Tr(W f ) yields 

Tr(W f+1 ) < Tr(Wf) (1 - eSexp(-e) (p f/ **(n,/ft)» 
= Tr(W f ) (1 -aJexp(-e)) 
< Tr(Wf)exp(-£<5exp(-e)), 

where the equality follows from (p t ,^>*(n t )) = ($?(p t ),TI t } = fit and the last inequality follows 
from the fact that 1 + z < exp(z) for all real numbers z. As Tr(Wo) = N, it follows that 



Tr(W T ) < Tr(W )exp(-T£^exp(-£)) = exp(-TeS exp(-e) + log(N)). 
On the other hand, we have 



Tr(W T ) = Tr 



T-l 



exp -eS £ <*>* (n f //3 f ) 



t=o 



> 



exp 



-eSA 



Combining (O and (TTOl ), we have 



> Texp(— e) 



V f=o 



log(N) 



(9) 



(10) 
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Using the inequality exp(— e) — e 2 /4 > 1 — e, and substituting the value of T specified by the 
algorithm, we have 

A N (0*(Y)) > (l + 2e) (exp(- £ ) - > (l + 2e)(l- £ ) > 1 

as required. 

Now it remains to establish an upper bound on the dual objective value achieved by Y. A 
similar method to the one used to prove the feasibility of Y above will provide a suitable bound. 
We begin by observing, for each t = 0, . . . , T — 1, that 

Tr(Z m ) = Tr [exp (e^Tr* (Th/fa + ■■■+ n t /j8 t ))] 

< Tr [exp (eS Tr* (n //3 + ■ • • + II t _i//J f _i)) exp (aJTr*(n t /j8 t ))] 
= Tr[Z f exp (a$Tr*(n f /0,))] • 

Given that 

||Tr*(n f )|| < ||((o|<8)i n ;)n f (|o)®i w )|| + ||((i|(8)iw)n t (|i}®i w )|| <i, 

and using the fact that /3 f > e in the case at hand, it follows that \5 Tr*(TL//3f) || < 1. We now 
apply Lemma [2] to obtain 

exp (e^Tr*(n t /j8 t )) < 1 + eS exp( £ ) Tr*(n t /j8 t ). 

As each Z t is positive semidefinite it follows that 



Tr(Z t+1 ) < Tr(Z f ) 1 + e<Sexp(e) 



Z t 



Tr(Z f 



-,Tr*(II f //3 f ) 



(11) 



Substituting £ t = Z f / Tr(Z f ) gives 

Tr(Z m ) < Tr(Zf) (1 + eSexp(e) (£ f/ Tr*(n t //3 f )» = Tr(Z f ) (1 + eS exp(e) (1* &,n t /ft)) • 

Now, as (O^Of) — 7I* (g> ^f,ITf) > 0, we may again use the fact that 1 + z < exp(z) for all real 
numbers z to obtain 



Tr(Z m ) < Tr(Z t ) (l + (%),n t / W ) < Tr(Z t ) exp 

Consequently 

Tr(Z T ) < Tr(Zo) exp ( I^M) = exp ( I^EM + log(M ) 



(12) 



On the other hand we have 

Tr(Z T ) = Tr 
and therefore 



7 



exp(a$£Tr*(IV/3, 
V f=o 



'T-\ 



>exp f e^Aa f Tr* I £n f /j6 f 



£3 
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Given that M < N it follows that 

||Tr,(Y)|| =A l( Tr,(Y)) < (1+2.) (fSW +?5SgQ) < £ 

Thus, Y is a dual feasible solution whose objective value is smaller than 7/ 8, and we conclude that 
the optimal value of our semidefinite program is at most 5/ 8 as required. 

It remains to state and prove the two lemmas that were required in the analysis above. They 
are as follows. 

Lemma 1. Let P G Pos (X ® Z) be any positive semidefinite operator, and assume that &im(X) = 2. 
Then P < 21 x ® Tr#(P). 

Proof. Let o~ x , o~ y and o~ z denote the Pauli operators on X . In matrix form they are 

038 =(! o)' ^=0 oO and az= (o -1 

As each of these operators is Hermitian, we have that (a x <g> tz)P{°~x <8> lz), {o~y ® ^z)P{°~y <S> ^-z) 
and (o~ z ® lz)P(°~z ® lz) are positive semidefinite. It therefore holds that 

21 A .^Tr A .(P) =P+(c7-^l Z )P(c7 x 0l2) + (c7y0ll2)P(c7y®l2) + ( t 7 z ®l2)P( t 7 z ®l2) > P 

as required. □ 

Lemma 2. Le£ P be an operator satisfying < P < 1. Then for every real number t] > 0, the following 
two inequalities hold: 

exp(^P) < 1 + rj exp(n)P, 
exp(—f]P) <t — n exp(—r/)P. 

Proof. It is sufficient to prove the inequalities for P replaced by a scalar A G [0, 1], for then the op- 
erator inequalities follow by considering a spectral decomposition of P. If A = both inequalities 
are immediate, so let us assume A > 0. By the Mean Value Theorem there exists a value Ao G (0, A) 
such that 

exp(r/A.) — 1 =f]exp ^ A ^ < ^ exp (^) / 

from which the first inequality follows. Similarly, there exists a value Ao G (0, A) such that 

exp(-?/A) - 1 

— — ^ = -?/exp(-j/A ) < -nexp(-rj), 

which yields the second inequality. □ 



5 Proof that QIP is contained in PSPACE 



With the algorithm from the previous section in hand, the proof that QIP C PSPACE follows the 



same approach used in |JUW09| to prove QIP (2) C PSPACE. The proof is described in the two 
subsections that follow. 
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5.1 Simulation by bounded-depth Boolean circuits 

Let A = (Ayes, A no ) be a promise problem in QIP. Our goal is to prove that A G PSPACE. Given 
that PSPACE = NC {poly), as was mentioned in Section [2~4l it suffices to prove A G NC(poly). 

Using Theorem 5.4 of |MW05l we have that there exists a single-coin QMAM-protocol for A 
with perfect completeness and soundness probability 1/2 + e, for e = 1 /64. (Of course any other 
sufficiently small positive constant would do, and in fact one can replace e with an exponentially 
small value — but this choice is sufficient for our needs.) We will make a small modification in 
Arthur 's specification so that he always accepts outright with probability 4e, and otherwise mea- 
sures the registers sent by Merlin according to his original specification. With this modification in 
place, we have that if x G A yes , then Arthur can be made to accept with certainty, while if x G A n0 
then the maximum probability with which Arthur can be made to accept is smaller than 1/2 + 3e. 
It also holds that every strategy of Merlin causes Arthur to accept with probability at least 4e. 

Now, for any fixed choice of an input string x G A yes U A n0 , let Q be the operator defined from 
this modified specification of Arthur on the input x as was described in Section[3j Give that Arthur 
always accepts with probability at least 4e, it follows that the smallest eigenvalue of Q is at least 
2e. Therefore, Q is invertible and satisfies || Q _1 1| < 1/ (2e). Moreover, the semidefinite program 
defined by Q, as described in Section|3l has an optimal value that is equal to 1 when x G A yes and 
smaller than 1/2 + 3e when x G A n0 . 

Next, consider a two-step computation as follows: 

1. Compute from a given input string x an explicit description of the operator Q specified above. 

2. Run an NC implementation of the algorithm from Section 0] on Q. 

The first step of this computation can be performed in NC(poly) using an exact computation. This 
follows from the fact that in NC(poly) one can first compute explicit matrix representations of 
all of the gates in the quantum circuit specifying Arthur 's measurements, and then process these 
matrices using elementary matrix operations to obtain Q. Note that, without loss of generality, the 
description of Q has length polynomial in N, which (as defined in the algorithm) is the dimension 
of the space on which it acts. 

The second step of the computation, which is an NC implementation of the algorithm from 
Section |H is not quite as straightforward as the first step. In fact, it is only possible for us to 
approximate this algorithm in NC, as we only know how to approximate the operator Q~ 1//2 , the 
matrix exponentials, and the spectral decompositions needed to obtain the projection operators 
ITo, . . . , nr_i. Nevertheless, we claim that such an approximation is possible in NC, with sufficient 
accuracy to distinguish the two cases x G A yes and x G A n0 . This fact is argued in the subsection 
following this one. 

Under the assumption that the second step is performed in NC, we have that the composition 
of the two steps is an NC (poly) computation. We therefore obtain that A G NC (poly) as required. 

5.2 A high precision NC implementation of the algorithm 

It remains to argue that the algorithm from Section!!] can be approximated by an NC computation 
with sufficient accuracy to distinguish the cases x G A yes and x G A n0 as described above. It 
will be evident from the discussion that follows that obtaining sufficient accuracy in NC is not 
a significant challenge; and one could, in fact, demand much greater accuracy (by an order of 
magnitude) and still be able to perform the computation in NC. 
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The first step in the implementation of the algorithm is to approximate Q ' . In more pre- 
cise terms, we first compute an operator R such that R 2 is a close approximation to Q, and then 
compute in NC using an exact computation. To compute R, we may compute a spectral de- 
composition of Q, and then take R to be the operator that results by replacing each eigenvalue 
in this decomposition with its square root. It is straightforward to perform high-precision ap- 
proximations of these computations in NC with sufficient accuracy so that 1 1 Q — R 2 1 1 < £ and 
|| R^ 1 1| < 1/e. Now, if we compare two semidefinite programs, one defined by Q as specified in 
Section [3] and the other defined similarly with Q replaced by R 2 , we find that the optimal values 
are close. More specifically, given that || Q — R 2 1| < e, the optimal values of the two semidefinite 
programs can differ by at most 2e. Thus, the optimal value of the semidefinite program for R 2 is 
at least 1 — 2e > 7/8 in case x G A yes and at most 1/2 + 5e < 5/8in case x G A no . 

In the interest of clarity, to avoid introducing a new variable R into the analysis that follows, 
let us simply redefine Q at this point to be R 2 . Thus, Q~ 1//2 = is known exactly by our 
implementation of the algorithm and all of the requirements on Q are in place — which are that 
|| Q 1 ^ 2 1| < 1/e = 64 and the optimal value of the semidefinite program in Section[3] defined by Q 
is at least 7/8 if x G A yes and at most 5/8 if x G A no . 

Next, let us focus on the projection operators 

Ho, . . . , Ilj_i G Pos (<-f ® W) (13) 

and the density operators 

p ,...,p T eV(X®W®y) and ft, . . . ,£ T € D (W) (14) 

that are to be computed in the course of the algorithm. We will choose an integer K that we 
take to represent the number of bits of accuracy with which these operators are stored. In more 
precise terms, the algorithm will store the real and imaginary parts of each of the entries of the 
above operators ((13|) and ((14)1 as integers divided by 2 K . It will suffice to take K — c[log(N)~|, 
for a suitable choice of a constant c, although one could in fact afford to take K to be polynomial 
in N rather than logarithmic. As each entry of these operators has absolute value at most 1, the 
total number of bits needed to represent the entire collection of operators is 0(TKN 2 ), which is 
polynomial in N. 

In addition to the above operators, the algorithm will store the scalar values f$o, . . . , fij-i- 
These values do not need to be approximated; each value /3 t is computed exactly as the ratio- 
nal number defined by the operators pt and n f stored by the algorithm. We will not consider that 
the operators W\, . . . , Wj and Z\, . . . , Zj are stored by the algorithm at all, as their only purpose 
in the computation is to specify the density operators p\, . . . ,pj and £i, . . . , £t- 

We will also take }i to be a small constant, say }i = 2~ w , that will represent an error parameter 
for the computation. Similar to the choice of K, we could afford to take }i to be significantly smaller 
than this and still be able to perform the computation in NC. 

Now, consider the two steps (a) and (b) that are performed within each iteration of the loop 
in step 2 of the algorithm. We must approximate these steps, and we demand the following accu- 
racy requirements when doing this. For step (a), we will require that the projection operator IT 
computed by the algorithm satisfies the condition 

n f (o(p f ) - 7 t x &)n f > p t - l^i^w, (is) 

where Pt is defined as the positive part of ^(pt) — 7!^ ® £f. It is possible to perform such a 
computation in NC by setting the error parameter rj in an approximate spectral decomposition 
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computation of O(/0f) — jlx <8> £f as rj = p. / (2M), for instance. Then, Flf is taken to be the appro- 
priately defined projection operator rounded to K bits of accuracy. For step (b), we will require 
that 

||p t+ l-W t+1 /Tr(W m )|| < ^ and ||f m - Z t+1 / Tr(Z m ) || < ^. (16) 

In these inequalities we do not consider that W f+ i and Z t+ \ are stored by the algorithm, but rather 
we consider that they are operators defined by the equations 

W i+ i = exp \^-eS £<J>*(n ; /jS ; ) j and Z f+1 = exp (tf ^Tr^Ily/^) 

for the particular operators TIq/ fio, . . . ,Tlt/ fit that are stored by the algorithm. The algorithm's 
approximations of Wf+i and Z t+ i determine the density operators pt+i and £t+i- As the matrix 
exponentials are to be computed for operators having norm bounded by T — 0(log N), it is clear 
that p t +\ and £t+i with the required properties can be computed in NC. 

Finally, we have that the total number of iterations in the algorithm is T = O(logN). Given 
that each of the iterations of the algorithm can be performed in NC, and that the total number 
of bits that must be stored from one iteration to the next is polynomial in N, we have that the 
composition of these T iterations can be performed in NC as well. 

It remains only to show that the approximations ( flBl and ( [TBI are sufficient to guarantee that 
the algorithm accepts or rejects correctly. This analysis is done in almost exactly the same way as 
was presented in Section|H Even though the operators 

po,---,pT-l, £o,---,£t-i, and U /fi , ■ ■ ■ ,n T _i/j8 T _i 

do not necessarily satisfy the precise equations that were assumed in Section 01 they may never- 
theless be used to construct primal and dual solutions to the semidefinite program that satisfy the 
required bounds. 

In the case that the algorithm accepts, a consideration of the operators p = pt, TI = FL, and 
£ = as before allows for the construction of a primal feasible solution with a large objective 
value. In place of 10, we have 

Hp) <a*® (7?+2Tr^(n«D(p)n) + -^i w ) / 

which allows for a lower bound of 1/ (7 + 2e + p) for the primal objective function. For our choice 
p — 2 ~ 10 of an error bound, this quantity is still lower-bounded by 5/8, which implies that the 
algorithm has operated correctly in this case. 

A similar analysis to the one before holds for the case of rejection as well. We consider the 
operators 

rio/fr, ..^n^x/jSr-! 

produced by the algorithm, and take 

1 t=o 

When proving the dual feasibility of Y we are no longer free to substitute pt = W t / Tr(W t ), but 
instead we must introduce a small error term due to the fact that p t is just an approximation to 
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Wf / Tr(Wt). By the first inequality of (TlBl above we may conclude that 

and by substituting this into ([TT]) and following a similar argument to the one from before we 
obtain 

A N (0*(Y)) > (l + 2e)(l + 2p) ((1 - exp(-e) - £) > 1. 

Thus, dual feasibility holds for Y. Along similar lines, by using (Tl5l ) and ( fT6l ), one finds again 
that the dual objective value achieved by Y less than 7/8, and therefore the algorithm operates 
correctly in this case as well. 
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